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@ Cloning vector plasmid, vector-primer derived therefrom and preparation method of cDNA bank 
using the same. 



@ Disclosed is a cloning vector plasmid suitable for preparing cDNA banks. The cloning vector plasmid of the 
present invention comprises, in the order mentioned from upstream to downstream region unless otherwise 
specified, a promoter PR1 acting In mammalian cells; an unique restriction enzyme site RE1 rarely existing in 
eukaryotic DNA and a promoter PR2 of RNA polymerase; an unique restriction enzyme site RE2 generating 3 - 
protruding dG tail; an unique arbitrary restriction enzyme site RE3: an unique arbitrary restriction enzyme site 
RE4 generating s'-protruding terminus; an unique restriction enzyme site RE5 rarely existing in eukaryotic DNA 
and a promoter PR3 of RNA polymerase; and at least one kind of arbitrary restriction enzyme site RE6 
generating s'-protruding terminus existing at the arbitrary position before the restriction site RE1; and further, at 

N arbitrary positions other than the above-described restriction enzyme sites and promoters, an origin 0R1 for 

^ replication in mammalian cells, an origin 0R2 for replication of a single-stranded phage, an origin 0R3 for 

IX> replication in E. coli , and a selective marker, 
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CLONING VECTOR PLASMID, VECTOR-PRIMER DERIVED THEREFROM AND PREPARATION METHOD OF 

CDNA BANK USING THE SAME 



The present invention relates to a multi-functional cloning vector, a vector-primer derived therefrom, and 
a method to prepare cDNA bank using the same. This invention provides a method to obtain readily a 
cDNA which can be used to produce a large amount of protein which is available as a medicine. 

Proteins constituting ceils in our body play an central role to maintain our life as a skeleton of cell, a 

5 catalyst of reaction, and a mediator of signal transduction. Thus finding new proteins and elucidating those 
structure and function are important not only to know life but also to use them as a medicine or a reagent 
for diagnosis. Although there have been many trials to use these proteins functioning in our body as a 
medicine, few proteins have been available because of difficulty of preparing a large amount of protein. The 
recent progress of genetic engineering made it easy to find a gene encoding an useful protein from human 

70 cells and to express it in bacteria or animal cells in high yields. Using this technique recently many proteins 
were developed as a medicine. 

Any human protein has a potential availability as a medicine because it functions in our body. It has 
been known that many genetic diseases are caused by disappearance of a protein playing an important role 
in a living cell. Therefore to produce a large amount of protein and to elucidate its function- and the 

75 relationship to the disease are an important step for developing new medicine. If the investigation is begun 
from the purification of a protein showing a target activity, it takes a long time and much labor to obtain a 
large amount of starting material. To avoid this problem, the inventors take a strategy to approach from a 
gene by which an information of amino acid sequence of protein is encoded. In our plan, cDNA 
(complementary DNA) is synthesized from all mRNA which is transcribed from genome and is translated to 

20 protein, and then each cDNA is characterized on the encoding protein. 

The number of genes on human genome is estimated to be approximately 10^, which may correspond 
to the number of human proteins. Recently the determination of whole sequence of human genome is 
discussed world-widely, which is called the Human Genome Project. If this plan is realized, the amino acid 
sequence of all proteins is expected to be elucidated. It is, however, difficult to determine the amino acid 

26 sequence only from the nucleotide sequence of genome because the genome consists of exons encoding a 
part of protein and introns not encoding protein. In the cell, mRNA transcribed from the genome is 
processed by deletion of intron and the processed mRNA containing only exons is transcribed to protein on 
ribosome. Therefore if we determine the nucleotide sequence of the processed mRNA, we can obtain the 
information of the amino acid sequence of protein encoded by mRNA. The progress of the gene 

30 recombination technique enabled us to convert mRNA to cDNA and to introduce it into E.coli According to 
this technique we can synthesize a "cDNA vector" which contains cDNA derived from one species of 
mRNA. and construct the "cDNA library" which is a group of E,coll containing cDNA vectors. If we 
determine the nucleotide sequence of all cDNAS contained in the cDNA library, which corresponds to the 
nucleotide sequences of all mRNA existing in a cell, we obtain the amino acid sequence of ail proteins 

35 synthesized in a cell. We call the group of sequence-detemnined CDNAs the "cDNA bank". If the obtained 
cDNA vector is introduced and expressed in adequate animal cells, the expression product may be used for 
an assay screening. Taking above strategy we can construct the cDNA bank of human proteins, which is 
called the "homo-protein" cDNA bank, in which the amino acid sequence of each cDNA has been 
detennined and each cDNA can be expressed in mammalian cells. Here we describe only the case of 

40 human protein as an example, but this strategy can apply to the case of the other animal and plant cells, 
and we can obtain also an useful infonnnation about proteins of these cells. 

The success of above strategy depends on the quality of cDNA library. From the point of this view, 
there are many problems in the cDNA library prepared by known methods, for example, that subcloning of 
a cDNA fragment to an adequate vector is required to sequence the cDNA. to screen a full-length cDNA 

45 using it as a probe, or to express it in animal cells. It takes a long time and much labor to analyze many 
cDNA clones by this subcloning procedure. This problem is solved by developing a novel method by which 
above all procedure can be carried out on the same vector. The minimum requirements for a cDNA vector 
satisfying above method is , 1) to contain a directional full-length cDNA clone. 2) to be sequenced easily, 3) 
to be used to prepare a probe which can be used for various screening. 4) to be expressed in in vivo or In 

60 vitro systems. The term "full-length cDNA clone" described in the requirement 1) means the cDNA clone 
containing at least whole coding region of protein encoded by mRNA in this specification, but strictly the 
cDNA clone containing the whole sequence of a template mRNA from S end to poly A tail. 

The most popular method to prepare cDNA at present Is so called the Qubler-Hoffman method 
described below [Gene 25:263-269 (1983)]. The first strand of cDNA is synthesized from poly(A)*RNA 



EP 0 426 455 A2 



template isolated from cells and an oligo dT primer using reverse transcriptase. Then the RNA strand is 
replaced by the second DNA strand using Eco// RNase H, E.coll DNA polymerase I, and E.coli DNA ligase. 
After blunting both ends of doubie-stranded cDNA by T4 DNA polymerase, an adequate oligonucleotide 
linker DNA is added. Then the resulting cDNA is ligated to a phage vector or a plasmid vector, and used to 
s transform £co//. It is difficult to obtain a full-length cDNA using this method, because when synthesizing a 
second strand S terminal sequence of mRNA used as a primer is deleted and both terminals of cDNA are 
often deleted by exonuclease activity of DNA polymerase I or T4 DNA polymerase. The vector containing 
an origin and a promoter for expression in mammalian cells, a replication origin of a single-stranded phage, 
and an RNA polymerase promoter, for example, such as pCDM8 [B.Seed, Nature 329:840-842(1987)] is 
70 known. Using this vector, preparation of single-stranded DNA for sequencing, preparation of RNA probe, 
synthesis of mRNA for fn vitro or In vivo translation, and expression in mammalian cell are possible, but 
the cDNA insertion is not directional. Thus the cDNA library prepared using the Gubler-Hoffman method 
dose not satisfy the requirement 1). Furthermore a large size of pCDMS which is 4.8 kbp long is not 
suitable to clone a long size of cDNA. 
15 The Okayama-Berg method [IVIoI.Cell.Biol. 2:161-170(1982)) is known as a method giving full-length 
cDNA clone at high frequency, in which the dt-tailed vector-primer is used for synthesis of the first strand 
cDNA from poly(A)*RNA by reverse transcriptase. Following addition of dc tail, the dG-tailed linker DNA is 
iigated and then RNA is replaced to DNA by E.coii RNaseH, Eco// DNA polymerase I. and Eco// DNA 
ligase. Since the dC tail addition occurs rarely at the 3' end of first strand cDNA extended incompletely, the 
20 dC-tailed clone gives a fulMength cDNA preferentially. Using a vector-primer furthermore causes the 
directional insertion of cDNA. Okayama et al. [Mol.cell.Biol. 3:280-289(1983)] have developed the expres- 
sion system in mammalian celis by using linker DNA containing an origin and a promoter of SV40. Honjo 
has developed the expression system in Xenopus oocyte of mRNA transcribed in vitro by using linker 
DNA containing SP6 promoter [Japanese Patent 62-74291]. However the use of Okayama-Berg method Is 
25 limited because of requiring a high quality of technique. The large number of dG tails at the 5 terminus 
make it often difficult to sequence from 5 terminus. It is further difficult to prepare probes by this method. 
Thus the cDNA library using this method dose not satisfy the requirements 2) and 3) described above. 

Although there have been several improved methods, no methods satisfy all of four requirements 
described above. Since even the lack of one requirement makes it difficult to carry out above strategy, the 
30 development of method satisfying all above requirements is necessary. 

In view of the above, it is desirable to provide a method to construct a cDNA vector which satisfies the 
following four requirements, 1) to contain the directional full-length cDNA clone. 2) to be easily sequenced. 
3) to be used to prepare an RNA probe which is complementary to a full-length cDNA and is used for 
screening. 4) to be able to express in a mammalian cell. A plasmid vector and a vector-primer derived from 
36 it, are desirable, for carrying out the above method. 

The inventors succeeded in constructing a novel vector for cDNA cloning and in constructing a human 
cDNA bank using it The invention provides a cloning vector which comprises a promoter PR1 acting in 
mammalian cells, followed by an unique restriction enzyme site RE1 rarely existing in eukaryotic DNA and 
a promoter PR2 of RNA polymerase, followed by an unique restriction enzyme site RE2 generating 3 - 
40 protruding dG tail, followed by an unique arbitrary restriction enzyme site RES, followed by an unique 
arbitrary restriction enzyme site RE4 generating 3'-protruding terminus, followed by an unique restriction 
enzyme site RE5 rarely existing in eukaryotic DNA and a promoter PR3 of RNA polymerase, and at least 
one kind of arbitrary restriction enzyme site RE6 generating 3'-protruding terminus existing at the arbitrary 
position before the restriction site RE1, and further, at the arbitrary position except above regions, an origin 
46 0R1 for replication in mammalian cells, an origin 0R2 for replication of a single-stranded phage, an origin 
0R3 for replication in Eco//. and a selective marker. 

The invention further provides a vector-primer which is prepared from a cloning vector piasmid 
described above. After cutting a cloning vector at a restriction site RE4, the dT-tail is added by terminal 
deoxynucleotidyl transferase and then cut at the restriction site RE3, 
60 The invention further provides a method to prepare a cDNA bank comprising cDNA vectors which 
contains sequence-determined cDNA insert and can be expressed in mammalian cells. This method 
includes synthesizing GDNAs from poly(A)*RNA isolated from cell using vector-primer described above, 
transforming Eco// with ^them. preparing a cDNA plaque library by infection of a helper phage, and 
sequencing cDNA from S terminal end using a single-stranded DNA isolated from a single-stranded phage 
65 plaque. 

The invention further provides a method to prepare a library-specific cDNA bank including cDNA 
vectors which contains sequence-determined cDNA insert and can be expressed in mammalian cells. This 
method comprises preparing two kinds of cDNA library LA and LB using above method, preparing sense 
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RNA probes from the library LA using RNA polymerase, preparing a group of anti-sense single-stranded 
cDNAS from the library LB. obtaining the library LA - specific cDIMA clones by screening the library LA 
using probes which contains a hybridization mixture between RNA probes and single-stranded cDNAS 
prepared above, and sequencing them from 5 terminus. 

5 Using techniques embodying the invention we may clone a full-length cDNA without a complicated 
procedure such as subcloning, detennine Its nucleotide sequence, prepare a complementary probe for 
screening, and express it in mammalian cells. Therefore this cloning vector is a powerful tool especially to 
prepare a cDNA bank originated from various mRNAs. The "cDNA bank" In this specification means the 
cDNA library in which the nucleotide sequence of each cDNA clone has been already determined. It may 

10 be possible to determine the amino acid sequence of all proteins composed of human or any other animals* 
cell In shorter period than using conventional methods. 



BRIEF DESCRIPTION OF THE DRAWINGS 



75 



Figure 1 shows the function of a cDNA vector according to an embodiment of this invention; 

Figure 2 shows the arrangement of promoters in a cloning vector according to an embodiment of this 

invention; 

20 Rgure 3 shows the restriction map of the vector pKA1; 

Figure 4 shows a process to synthesize cDNA using a vector plasmid; 

Figure 5 shows the subtraction method using a cDNA system according to an embodiment of this 
invention; 

Figures 6 to 8 show the procedures to construct vectors according to embodiments of this invention; 
25 Figure 9 shows the method to prepare a vector-primer according to an embodiment of this invention; 

Figure 10 shows the results of plaque hybridization and subtraction using the protocol according to an 
embodiment of this invention. 
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DETAILED DESCRIPTION 



Rgure 1 shows the functions of a cDNA vector embodying the invention. The construction method of a 
cloning vector, the preparation method of a cDNA vector using It, and examples showing that this cDNA 
vector satisfies above four requirements will now be described. 



(1) Cloning Vector 

The most characteristic point in this invention is the cloning vector used for cDNA synthesis. The 

40 vector-primer contains the full-length cDNA clone and Is preferably endowed with the multi-functional 
properties to satisfy the requirements 2), 3) and 4). As described above, the cloning vector included in this 
invention comprises a promoter PR1 acting in mammalian cells, followed by an unique restriction enzyme 
site RE1 rarely existing in eukaryotic DNA and a promoter PR2 of RNA polymerase, followed by an unique 
restriction enzyme site RE2 generating s'-protruding dG tail, followed by an unique arbitrary restriction 

46 enzyme site RE3, followed by an unique arbitrary restriction enzyme site RE4 generating s'-protruding 
terminus, followed by an unique restriction enzyme site RES rarely existing in eukaryotic DNA and a 
promoter PR3 of RNA polymerase, and at least one kind of arbitrary restriction enzyme site RE6 generating 
s'-protoiding terminus existing at the arbitrary position before the restriction site RE1 , and further, at the 
arbitrary position except above region, an origin 0R1 for replication in mammalian cells, an origin 0R2 for 

60 replication of a single-stranded phage, an origin 0R3 for replication in E.coii. and a selectable marker. 

The term " unique" in the above description means that the restriction site appears only once on the 
plasmid. Figure 2 shows the arrangement of promoters PR1 , PR2, and PR3. The change of the order of 
RE1 and PR2 as well as the order of RES and PR3 is allowable. It is desirable that no extra nucleotide 
sequence exists between the restriction site and promoter. The restriction sites RE2, RE3, and RE4 are 

55 necessary to satisfy the requirement 1). 0R2 and RE6 to satisfy the requirement 2), PR2, PR3, RE1, RES, 
and 0R2 to satisfy the requirement 3), and 0R1 and PR1 to satisfy the requirement 4). 

A promoter for expression in mammalian cells. PR1, includes promoters such as an early or late 
promoter of SV40, a late promoter of adenovirus 2, and a 5 long terminal repeat (LTR) of retrovirus. An 
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unique restriction enzyme site rarely existing in euI<aryotic DNA, RE1 and RES. includes restriction sites 
such as Notl. SnaBI, Sfil, and Spel. An unique restriction enzyme site producing 3 -protruding dG oligomer 
tails after digestion. RE2. includes restriction sites such as BstXt. Sfil, Dralll. and Bgll. Since the nucleotide 
sequence of cut site In these restriction enzyme sites can be replaced by any nucleotide pairs. If dG-dC 
pair as an nucleotide pair is used in them, cutting by these restriction enzymes produces a 3 -protruding dG 
terminus. For example, the nucleotide sequence of restriction site for BstXI is shown as beiow: 
S -CCANNNNNNTGG-3^ 
3' -GGTNNNNNNACC-S 

If replacing N in the top sequence by G, digestion of this site with BstXI produces the following terminal 

sequence. 

5'-CCAQQQGG-3' 

3'-GGTC-5' 

Thus RE2 includes any l<lnd of restriction sites which can produce a 3 -protruding terminus after restriction 
enzyme digestion, and so is not limited by the above examples. 

RE6, which comprises at least one kind of arbitrary restriction enzyme site producing 3 -protruding 
terminal existing at the arbitrary position before the restriction site RE1, includes any restriction site 
producing a 3'-protrudlng terminus as far as it is different from RE1, RE2, RE3. RE4, and RES, and 
preferably exists rarely in eukaryotic DNA. One example is Sfil. but there are many other examples. It is 
desirable that RE6 contain more than two different restriction sites. It is not necessary that RE6 is an unique 
site but that exists at the upstream of RE1 . 

PR2 and PR3 for an RNA polymerase promoter includes promoters of polymerases such as T3 RNA 
polymerase. T7 RNA polymerase, and SP6 RNA polymerase. 0R1 for a replication origin in mammalian 
cells Includes an origin of SV40. for example. 0R2 for a replication origin of a single-stranded phage 
Includes origins of f1 phage or M13 phage. 0R3 for a replication origin of E.coli includes origins of 
plasmids such as pBR322 or pUC series. A selective marker includes drug-resistant markers such as those 
for amplcillin, tetracyclin, or kanamycin, and the other markers such as nutrient requirement or temperature 
sensitivity. Although a neomycin-resistant gene can be Introduced to this vector as a drug-resistant marker 
for a mammalian cell system, this may cause a disadvantage that the size of vector becomes too long. It is 
further desirable to include a splicing junction and poly A addition signal sequence for high expression in 
mammalian cells. 

A cDNA Is Introduced in the directional manner between the restriction sites RE2 and RE4, in which 5 - 
terminal of mRNA is adjacent to RE2. The direction of a replication origin of a single-stranded phage 
determines which strand of double strands in vector plasmid is produced as a single strand. In this vector 
system, the single-stranded DNA consists of an anti-sense strand for cDNA. In this specification, the strand 
possessing the same sequence as template mRNA except U replaced by T is defined as a sense strand, 
and the complementary sequence as an anti-sense strand. 

To determine the nucleotide sequence from s' terminal of cDNA. any synthetic oligonucleotide which 
anneals at the upstream of RE2 can be used as a sequencing primer. If the vector is designed to have the 
nucleotide sequences of primers used for sequencing of M13, these primers can be used for sequencing of 
a cDNA insert. 

An RNA polymerase promoter PR2 is introduced into the vector in the direction as producing a single- 
stranded RNA of sense strand for mRNA template. In the same manner, PR3 is introduced to produce an 
anti-sense strand. The plasmid described above can be readily constructed using known plasmids and 
synthetic oligonucleotides. The plasmid pKA1 described below as an example uses an early promoter of 
SV40 for PR1. SnaBI for RE1, T7 promoter for PR2, BstXI for RE2, EcoRV for RE3, Kpnl for RE4, Notl for 
RES, a T3 promoter for PR3. Sfil. PstI, and SphI for RE6. an origin of SV40 for 0R1, an origin of phage f1 
for 0R2, an origin of pBR322 for GR3, an ampiclllin resistant marker ()3-lactamase gene) for a drug-resistant 
marker, and the nucleotide sequences of universal and reverse primer of Ml 3 phage for the priming site 
where a sequencing primer anneals, respectively. pKA1 further Includes a 16S splicing site at the 
downstream of a SV40 promoter and a poly A addition signal sequence at the downstream of a T3 
promoter. The size of pKAl is 3.6 kbp and is shorter than known multi-functional cloning vectors, that 
makes it easy to get longer cDNA clones. 

The plasmid described above is cut at the restriction site RE4 to produce a 3 -protruding terminal, and 
then dT tail is added to the terminal using terminal deoxynucleotityl transferase. The desirable number of 
added dT is from 30 to 70. preferably from 50 to 60. Cutting at the restriction site RE3, the longer DNA 
fragment containing a vector is isolated with agarose gel electrophoresis and is used as a vector-primer. 
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(2) Preparation method of cDNA library 

PoIy(A)*RNA isolated from cells using a known method Is annealed with a vector-primer described 
above, and then reverse transcriptase is used to synthesize the first strand of cDNA. The dC tail, the 

5 number of which is preferably from 4 to 10, is added to the end of the first strand cDNA using tenminal 
deoxynucleotidyl transferase. After cutting at the restriction site RE2, self-ligation Is carried out using E,coli 
DNA ligase. Then the RNA strand is replaced by the second cDNA strand using E,coli RNase H. E.coit 
DNA polymerase I, and E.cotI DNA ligase. This procedure is shown in Figure 4. The obtained ligation 
mixture containing cDNA-harboring plasmid is used to transform a host E,cotl containing a f' factor. An 

70 E.coli strain containing a f' factor Includes hosts such as NM522. JM101, JM103, JM105. JM109. and 
DH5aF'. Transformants constituting a cDNA library are cultured and amplified in an appropriate medium 
such as L-broth or 2xTY containing ampicillin if the vector has an ampicillin-resistant gene, and then are 
stocked in a deep freezer after adding glycerol or DMSO. If the culture of cDNA library is infected with a 
helper phage, each E.coli cell releases in a medium a phage containing a single-stranded cDNA derived 

15 from a cDNA vector. This single-stranded cDNA have an anti-sense strand for mRNA template, After 
Infection of helper phage to cDNA library and incubation, E.coli cell is harvested, washed thoroughly with 
medium to remove helper phage, and diluted with medium. This diluted cell solution is mixed with a soft 
agar or a soft agarose solution containing E.coli host not harboring cDNA vector, the mixture is poured onto 
agar plates. After incubation plaques can be seen where cDNA-harboring cells exist. E.coli cells picked up 

20 from a single plaque by a toothpick are cultured in a liquid medium such as 2xTY. A part of culture medium 
is stored in a freezer. After the remaining culture is centrifuged, the cell pellet is used to isolate plasmid and 
the supernatant is used to isolate phage from which a single-stranded cDNA is isolated by known methods. 
It is desirable that a part of supernatant containing phage is stored in a refrigerator. A helper phage used 
above includes phages such as M13K07 [J.Vieira and J.Messing, " Methods in Enzymology" vol.153, p.3, 

25 1987], VCS-M13. and R408 (both can be purchased from Stratagene). 

We can select an alternative procedure to isolate a single-stranded DNA. in which before phage 
infection a single colony is picked up, incubated, and followed by helper phage infection. 

Using above method, various types of cDNA libraries can be prepared, such as transformants 
containing cDNA vectors originated from a single species of mRNA, cDNA vectors as a double-stranded 

30 DNA, cDNA vectors as a single-stranded DNA, and phages containing a single-stranded DNA. 



(3) Determination method of nucleotide sequence 

35 Either a single-stranded DNA or a double-stranded DNA can be used. A single-stranded DNA is 
preferred because more nucleotide sequences can be read. The nucleotide sequence can be determined 
using a dideoxy method in which synthetic oligonucleotide of 17 mer to 20 mer corresponding to the 
nucleotide sequence just before the restriction site RE2 is used as a sequencing primer and substrates 
labeled with radioisotope or fluorescent dye are used. A sequencing reaction is carried out using enzymes 

40 such as Klenow fragment, Taq DNA polymerase, T7 DNA polymerase, and reverse transcriptase. Using 
pKAl, an universal primer can be used for Ml 3 sequencing, by which approximately 400 nucleotides from 
s'-terminal of cDNA can be determined. If cDNA is a full-length clone, an initiation codon followed by an 
open reading frame in the determined sequence may be found. If no stop codon appears within the open 
reading frame, it is necessary to determine further sequence, in which case new primer corresponding to 

45 the nucleotide sequence near the 3'-end of the determined sequence should be synthesized to be used for 
further sequencing. Using the plasmid of the present invention, the. derivatives deleted from a 5 terminal of 
a cDNA insert can be prepared by cutting the cDNA-harboring plasmid at restriction sites RE1 and RE6 and 
followed by exonuclease treatment according to the known deletion method. These derivatives can be used 
to determine the nucleotide sequence using a primer corresponding to the nucleotide sequence just before 

60 the restriction site RE6, which conresponds to the reverse primer for Ml 3 in the case of pKAl system. 

Since the obtained nucleotide sequence corresponds to the sequence of a sense strand which is the 
same as the sequence of mRNA, this sequence can be used to search similarity to known sequences in 
database: a DNA database for a nucleotide sequence and a protein database for an amino acid sequence of 
an open reading frame. If there is similarity to a known sequence, the function and characteristics of the 

66 protein encoded by the obtained cDNA can be estimated. 

(4) Screening Method 



6 
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When determination of sequences of all cDNA clones included in the library is desired, it Is desirable to 
remove the already sequenced clones from the library to avoid the overlapped sequencing of the same 
clone. One aspect of the invention provides a subtraction method satisfying above demand. This method 
also can be used to obtain cell-specific clones or stimulus-induced clones. 

An example that a cDNA library LA is subtracted by a cDNA library LB will now be described. This case 
corresponds that a library LA contains novel clones or specific clones, on the other hand, a library LB 
contains sequence-determined clones or common clones. The library LA is cultured and amplified, and total 
plasmids containing cDNA vectors are isolated, which are cut at the restriction site RES and then reacted 
with the RNA polymerase acting on a promoter site PR2. If the substrate nucleotide labeled with 
radioisotope or fluorescence dye is added in the reaction mixture, the labeled RNA corresponding to the 
sense strand for each cDNA is obtained. On the other hand, the library LB is cultured and infected by a 
helper phage to prepare a single-stranded DNA from the supernatant. When the labeled RNAs prepared 
from library LA and the single-stranded DNA prepared from library LB are mixed and hybridized, the sense 
RNA strand hybridizes with the corresponding anti-sense single-stranded DNA derived from the common 
cDNA clones existing in both libraries, but labeled RNAs derived from specific cDNA clones in a library LA 
remains to be free. If these hybridization mixture is used as a probe to screen a library LA by the methods 
of plaque hybridization or colony hybridization, specific clones existing only in a library LA can be obtained. 
The principle of the subtraction method described above is shown in Figure 5. 

Since the cDNA vector described in this invention has a promoter PR3 for an RNA polymerase after the 
3' terminal of cDNA, an RNA probe which consists of an anti-sense strand for cDNA can be prepared by 
cutting the cDNA vector at the restriction site RE1 and reacting with an RNA polymerase acting on PR3. 
The product can be used as a probe for Northern hybridization. 



(5) Expression in animal cells 

Since the cDNA vector described in this invention has a replication origin and promoter functioning In 
mammalian cells, this can be used to express in mammalian cells if the isolated CDNA-harboring plasmid is 
introduced into cells by using known methods such as a phosphate calcium method or an electroporatlon 
method. If the Introduced cDNA encodes a receptor protein or a secreted protein, this method can be used 
for an expression screening. 

Cutting at the restriction site RES and reacting it with CAP-like nucleotide substrate such as m^GpppG 
and RNA polymerase acting on PR2, RNA which is available in in vitro or Xenopus oocyte translation 
systems can be obtained. 

The present invention will now be described by way of examples, but various changes and modifica- 
tions can be made without departing from the invention. 

The basic procedure of DNA recombination and the reaction conditions were in accordance with the 
literature [T.Maniatis et al. (1982), "IVIolecular Cloning. A Laboratory Manual", Cold Spring Harbor 
Laboratory]. 

Example 1 



Construction of Cloning Vector pKA1 



(1) Preparation of pTZ18RP1 (Figure 6) 

Ten ug of the plasmid pTZ18RP1 purchased from Phanmacia was digested by 100 units of BamHI and 
100 units of EcoRI. The reaction mixture was fractionated with 0.8% agarose gel electrophoresis (AGE), and 
then a DNA fragment of 2.9 kbp was isolated from the gel. Ten pmoles of synthetic oligomers L1 and L2. 
which contain the restriction enzyme sites EcoRI, BStX!. EcoRV, KpnI. Notl. and BamHI. were annealed and 
ligated to above DNA fragment. . 

LI s'-AATTCCAGGGGGGTGGTGGATATCCTGGTACCGCGGCCGCG- 3 

L2 3'-GGTCCCCCCACCACCTATAGGACCATGGCGCCGGCGCCTAG-5' E.coli HB101 was transformed 
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with the ligation mixture and plasnnids were isolated from several transformants. After tfie insertion site of 
synttietic oligomer in piasmids was sequenced using a reverse primer for l\/)13 sequencing, the plasmid 
possessing the designed sequence was denoted by pT218RP1. 

5 

(2) Preparation of pTZ18RP2 (Figure 6) 

The SnaBI site was introduced before a T7 promoter using site-directed mutagenesis. Transformant of 
E.co// CJ236 (purchased from BioRad) by the plasmid pT218RP1 prepared in section (1) was cultured in 5 

TO ml of 2xTY medium containing 50 ug/ml of ampicillin at 37 ' C for one hour. Then a helper phage M13K07 
(purchased from Pharmacia) solution was added and inoculated at 37 ' C overnight A single-stranded DNA 
(ssDNA) was isolated from the culture medium. The ssDNA annealed with 10 pmoles of synthetic oligomer 
L3 in which 5 terminal was phosphorylated by T4 polynucleotide l<inase was mixed with 4 units of Klenow 
fragment, 350 units of T4 DNA llgase, and 0,8 mM of dNTPs followed by incubation at 4 " C for 5 minutes, 

15 at room temperature for 5 minutes, and finally at 37 * C for 2 hours. 
L3 5'-CCATGATTACQTATTTAATACGA-3' 

Exoli JM109 was transformed with the ligation mixture and piasmids were isolated from several 
transformants. The plasmid which can be cut by SnaBI and possesses the designed sequence was denoted 
by pTZ18RP2. 

20 

(3) Preparation of pTZ18RP3 (Rgure 6) 

A T3 promoter was introduced to the BamHI site in pTZ18RP2. Ten ug of pTZ18RP2 prepared in 
25 section (2) was digested by 100 units of BamHI. After the reaction mixture was fractionated with 0.8% 
agarose gel electrophoresis, a DNA fragment was isolated from the gel. Ten pmoles of synthetic oligomers 
L4 and L5 containing a T3 promoter were annealed and ligated to above DNA fragment 
L4 5'-GATCTCCCTTTAGTGAGGGTTATTG-3' ^ 

L5 3' -AGGGAAATCACTCCCAATAACCTAG-s' E.co// NM522 was transformed with the ligation mixture 
30 and piasmids were isolated from several transformants. The plasmid possessing the designed sequence 
was denoted by pTZ18RP3. 



35 



(4) Preparation of pTZ18RP4 (Figure 6) 



The Xhol site and universal primer sequence (designated by U) for Ml 3 sequencing was introduced to 
the SnaBI site in pTZ18RP3. Ten ug of pTZ18RP3 prepared In section (3) was digested by 100 units of 
SnaBI. After the reaction mixture was fractionated with 0.8% agarose gel electrophoresis, a DNA fragment 
was isolated from the gel. Ten pmoles of synthetic oligomers L6 and L7 containing a Xhol site and universal 
40 primer sequence (designated by U) for Ml 3 sequencinp were annealed and ligated to above DNA fragment. 
L6 s'jGCGTCGAQTGTAAAACGAGQGCCAGTAC- 3 ^ 

L7 3'-CGGAGCTCACATTTTGCTGCCGGTGATG- 5' E.coll NM522 (purchased from Stratagene) was 
transformed with the ligation mixture and piasmids were isolated from several transformants. The plasmid 
possessing the designed sequence was denoted by pTZ18RP4. 

45 

(5) Preparation of pTZ19UD1 (Rgure 7) 

A reverse primer sequence (designated by R) for Ml 3 sequencing was Introduced between Bgll and 
50 BamHI sites in the plasmid pTZ19U. Two ug of pTZ19U (purchased from Toyobo) was partially digested by 
20 units of Bgll at 37 * C for 15 minutes. A fragment of 2.8 kbp cut at one site was isolated from the gel 
after fractionated with 1.2% agarose gel electrophoresis. After this fragment was digested by 100 units of 
BamHI, a fragment of 2.7 kbp was isolated with 1 .2% agarose gel electrophoresis. Ten pmoles of synthetic 
oligomers L8 and L9 containing a reverse primer sequence (designated by R) for Ml 3 sequencing were 
55 annealed and ligated to the BamHI-Bgll fragment of pTZ19Lf. 
L8 5';ACAGAGGAAACAGCTATGACCAT-3' 

L9 3'-AAGTGTGTCCTTTGTCGATACTGGTACTAG-5' E.co// JM109 was transfonned with the ligation 
mixture and piasmids were Isolated from several transformants. The plasmid possessing the designed 



8 



. EP 0 426 455 A2 



sequence was denoted by pTZ19UD1. 



(6) Preparation of pKAO (Figure 7) 

Ten ug of pTZ19UD1 prepared in section (5) was digested by 100 units of Bgll and 100 units of Hindlll. 
After fractionated witli 1.2 % agarose gel electrophoresis, a 1.4 l<bp fragment containing a f1 origin was 
isolated from the gel. Ten ixg of pLI (purchased from Pharmacia) was digested by 100 units of Hindlll and 
100 units of BamHI. After fractionated with 1 .8 % agarose gel electrophoresis, a 420 bp fragment containing 
an origin and early promoter of SV40 was isolated from the gel. Ten ug of pcDV1 (purchased from 
Pharmacia) was digested by 100 units of BamHI and 100 units of BgIL After fractionated with 0.8 % agarose 
gel electrophoresis, a 1 .6 i<bp fragment containing a poly A addition signal sequence was isolated from the 
gel. These three fragments were ligated and then E.coli HB101 was transformed with the ligation mixture. 
Plasmids were isolated from several transformants and digested by BamHI, Bgll and Hindlll to confirm that 
the restriction map of the obtained plasmid agrees with that of pKAO. 



(7) Preparation of pKAIa (Figure 8) 

After 10 ug of pTZ18RP4 prepared in section (4) was digested by 100 units of BamHI and 100 units of 
Xhol. a fragment of 117 bp was isolated with 1.8 % agarose gel electrophoresis. On the other hand, after 10 
ixg of pKAO prepared in section (6) was digested by 100 units of BamHI and 100 units of Xhol. a fragment 
of 3.3 l<bp was isolated with 0.8 % agarose gel electrophoresis. Both fragments were ligated and then E.coli 
HB101 was transformed with the ligation mixture. Plasmids were isolated from several transformants and 
digested by BamHI and Xhol to confirm that the restriction map of the obtained plasmid agrees with that of 
pKA1a. 



(8) Preparation of pKAlb (Figure 8) 

Ten ug of pLI was digested by 100 units of Pstl, and the terminals were blunted by T4 DNA 
polymerase. After this fragment was digested by Hindlll. a 500 bp fragment containing a SV40 promoter 
and 16S splicing site was isolated with 1.8% agarose gel electrophoresis. 

One ug of PkAia was digested by 100 units of Xhol, and the terminals were blunted by Klenow 
fragment After this fragment was digested by Hindlll, a fragment of 3 kbp was isolated with 0.8% agarose 
gel electrophoresis. 

Both fragments were ligated and then E.coli HB101 was transformed with the ligation mixture. Plasmids 
were isolated from several transformants and digested by Hindlll and BamHI to confirm that the restriction 
map of the obtained plasmid agrees with that of pKAIb. 

(9) Preparation of pKAl (Figure 8) 

After 10 ug of pKA1b was digested by 100 units of Hindlll, a fragment of 3.6 kbp was isolated with 1.8 
% agarose gel electrophoresis. Ten pmoles of synthetic oligomers L10 and L11 containing a restriction site 
Sfil were annealed and ligated to the above fragment. 
L10 5'-AGCTAG6CCTACATGGGGA-3' 

L11 3-TCCGGATGTAGCGGTTCGA-5' E.coli HB101 was transformed with the ligation mixture and 
plasmids were isolated from several transformants. After the insertion site of oligomers was sequenced 
using a reverse primer for M13 sequencing, the plasmid possessing the designed sequence, in which 
the order of restriction sites is Sphl-Sfil-Hindlll. was denoted by pKA1. The details of restriction map of 
pKAl has been already shown in Figure 3. 



EXAMPLE 2 
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Preparation OF cDNA Library 



(1) Preparation of vector-primer (Rgure 9) 

After 100 ug of pKA1 was digested by 200 units of Kpni, a fragment was isolated witli 1.8 % agarose 
gel electrophoresis. A fragment of 70 iiQ was incubated at 37 * C for 30 minutes in the reaction mixture 
containing 20 uM of dTTP and 375 units of terminal deoxynucleotldyl transferase (purchased from Takara 
Shuzo). The number of tailed dT was estimated to be approximately 60, which was calculated from the 
uptalce of [a-32p]dTTP added to a part of the reaction mixture. After digested with EcoRV and fractionated 
with 0.8% agarose gel electrophoresis, a longer fragment was isolated and used as a vector-primer. 

(2) Preparation of poly(A)*RNA 

Total mRNA was Isolated using a guanidinium/thlocyanate method from the cell culture of human cell 
lines such as HUT-78, HUT-102, U937, and HT-1080. Poly(A)*RNA was purified from this mRNA using an 
oligo dT column. 



(3) Synthesis of cDNA 

Total process of cDiMA synthesis is shown in Figure 4. The reaction condition described in the protocol 
of the Okayama-Berg method was used. Enzymes were purchased from Takara Shuzo. Briefly, after 
annealing 3 ug of poly(A)*RNA prepared in section (2) with 1.5 ug of vector-primer prepared in section (1). 
reverse transcriptase was added to synthesize the first cDNA strand. The reaction mixture was extracted 
with phenol, and then cDNA was precipitated by adding ethanol. The precipitate was dissolved in the 
reaction solution containing 1 uM of dCTP and reacted with 15 units of terminal deoxynucleotldyl 
transferase to add a dC tail. After phenol extraction and ethanol precipitation of the reaction mixture, the 
product was digested with 50 units of BstXI (purchased from New England Biolabs). After phenol extraction 
and ethanol precipitation of the reaction mixture, the product was annealed and self-ligated using E.coli 
DMA ligase. Then adding dNTPs including dATP, dGTP, dCTP, and dTTP, E.coli DNA polymerase I and 
E.coll RNase H, the RNA strands were replaced by the DNA strands. 



(4) Preparation of cDNA library 

E.coli NM522 was transformed with the solution containing cDNA vectors prepared in section (3). 
Transformants were suspended in 2xTY medium containing 100 ug/ml of ampicillin and incubated at 37 ' C 
overnight. A part of transfonmant was spread on an agar plate and Incubated overnight. The number of 
independent clones including in the library was estimated by counting the number of colonies appeared on 
the agar plate to be approximately 2 x 10^ - 10^ A part of culture was infected by helper phage M13K07 
(purchased from Pharmacia) and incubated at 37 * C overnight after adding kanamycin to 70 ug/ml. A part 
of the resulting culture was stocked at -80 * C as a 15% glycerol solution. 



(5) Isolation of cDNA vector plasmid 

The E.coli culture consisting of the cDNA library prepared in section (4) was harvested by centrifuga- 
tion. The resulting pellet was washed thoroughly with 2xTY medium, mixed with a culture of E.coli NM522 
and a soft agar medium kept warm at 37 ' C. and poured onto 2 x TY agar plates. After incubation at 37 * C 
overnight, a single plaque appeared on plates was isolated, suspended in 2 x TY medium, and inoculated at 
37 *C overnight. The culture was separated to the cell pellet and the supernatant by centrifugation. A 
double-stranded plasmid was isolated from the cell pellet, and a single-stranded DNA from the supernatant. 
The overall cultures were stocked at -80 *C as a 15 % glycerol solution, the double-stranded plasmid and 
the single-stranded DNA were stocked at -20 ' C as a TE solution, and the culture supernatant was stocked 
at 4 ' C in a refrigerator. 
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EXAMPLE 3 



Preparation of Human cDNA Bank 



(1) Size determination of human cDNA insert 

The double-stranded plasmids isolated in section (5) of Example 2 were digested with EcoRI and Not!. 
The size of the resulting cDNA fragment was determined with 0.8 % agarose gel electrophoresis. 



(2) Determination of nucleotide sequence 

The determination of the nucleotide sequence of a single-stranded cDNA prepared in section (5) of 
Example 2 was carried out using a DNA sequencer (purchased from Applied Biosystems). The sequencing 
reaction was performed by the dideoxy method using a fluorescence-labeled universal primer for M13 
sequencing and Taq polymerase. This method could provide the nucleotide sequence of approximately 400 
bp from 5 end of cDNA. We found the 4 - 10 tails of dG at the s' end of cDNA. The obtained nucleotide 
sequences were filed in the disk of computer as the "Homo-Protein cDNA database". 



(3) Analysis of nucleotide sequence 

The filed nucleotide sequences were analyzed using the computer software including the genetic 
information analysis system GENIAS and the protein information analysis system PRINAS (both purchased 
from Mitsui Knowledge Industry). First, the homology search was carried out using each nucleotide 
sequence and the DNA database "GenBank"". Then whether an initiation codon and an open reading frame 
exists in the amino acid sequence converted from the nucleotide sequence was investigated. If an open 
reading frame exists, the homology search of the amino acid sequence was can-ied out using the protein 
database "PRINAS". 

Using above method a cDNA bank, that is, the group of cDNA vectors in which the size of cDNA insert, 
the nucleotide sequence from 5 end, and the amino acid sequence encoded by the open reading frame 
were determined, was constructed. So far 20 % of analyzed clones had an open reading frame which 
sequence has a similarity with the amino acid sequence of a known protein In the database. Most of them 
are full-length clones. The obtained clones includes a wide variety of proteins such as metabolic enzymes, 
proteins regarding with DNA replication or protein synthesis, secreted proteins, and an extra-cellular matrix. 
Table 1 shows the examples of clones encoding proteins which have a similarity with a known protein. 



EXAMPLE 4 



Plaque Hybridization Using RNA Probe 



(1) Preparation of RNA probe 

An RNA probe was prepared using a kit purchased from Stratagene. Briefly, after the double-stranded 
cDNA vector plasmid was cut with Not!, the product was reacted in the reaction mixture containing 50 uCi 
of [a-32p] CTP and T7 RNA polymerase to synthesis an RNA probe corresponding to the sense strand of 
cDNA. 
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(2) Plaque Hybridization 

A cDNA plaque library was formed on agar plates using the method described in section (5) of Example 
2. Dry nitrocellulose filters were placed on the surface of agar plates to transfer phages existing around the 

5 plaque. In the same way. two replica filters were prepared. The filters were air-dried, and baked at 80 *C 
for 3 hours. The filters were preincubated at 37 ' C for 2 hours in the solution containing 50 % formamide, 5 
X SSC. 10 X Denhardfs solution, and 0.1% SDS. After adding the RNA probe prepared in section (1), the 
incubation was sustained at 37 *C for 16 hours. The filters were washed with the solution containing 0.2 x 
SSC and 0.1% SDS at 65 * C for 1 hour, and air-dried. The resulting filters were subjected to autoradiog- 

10 raphy. Figure 10 (A) shows the result of a model experiment using a plasmid vector containing an actin 
cDNA. We can see positive signals where plaques exist. 



75 



45 



50 



(3) Subtraction method 



An RNA probe and a single-stranded DNA were prepared from the clone containing an actin cDNA, and 
prehybridized. The resulting mixture was used as a probe for hybridization with a filter spotted with phage 
containing a single-stranded actin cDNA. As shown in Figure 10 (B), we can see no positive signal on the 
autoradiogram. This means that a single-stranded DNA can inhibit completely the probe-function of a 
20 corresponding RNA by prehybridization, thus that the subtraction method shown in Figure 5 is possible. 



EXAMPLE 5 

25 

Determination of Whole Nucleotide Sequence of cDNA Insert Using the Deletion Method 

30 The plasmid pKATNFI was isolated from the clone No.16 shown in Table 1 which contains the cDNA 
encoding TNF (Tumor Necrosis Factor). Fifteen ug of this plasmid was digested with 20 units of SphI and 
then 15 units of SnaBI. After treated with exonuclease III at 37 'C , the sampling was carried out at 1 
minute interval. These samples were mixed, treated with mung-bean nuclease and Klenow fragment, and 
then self-ligated to obtain plasmids deleted from 5 tenminai of the cDNA insert. Above reaction was carried 

35 out using a kilo-deletion kit purchased from Takara Shuzo. The single-stranded DNA was prepared from 
each deleted clone using the method described in section (4) and (5) of Example 2, and used to determine 
its nucleotide sequence using the method described in section (2) of Example 3, where a reverse primer for 
M13 sequencing was used as a sequencing primer. As a result, plasmids lacking 150 bp, 300 bp, 600 bp. 
850 bp. 1050 bp, and 1300 bp from 5' terminal were obtained. The sequences from 5 terminal of these 

40 clones were determined and overlapped to obtain the sequence covering a whole region of cDNA. The 
obtained sequence agreed completely with the reported sequence of TNF. 



EXAMPLE 6 



Expression of cDNA in Mammalian Cell 



The pKATNFI which sequence was determined in Example 5 was introduced into C0S7 cells (obtained 
from ATCC), and its culture supernatant was tested whether it shows a TNF activity. The isolated plasmid 
pKATNFI was transfected into 5x10^ cells of C0S7 using an eletroporation method. After the trasnfected 
C0S7 cells were inoculated in MEM medium for 3 days, the culture supernatant was subjected to a TNF 
55 assay using mouse L929 ceils (obtained from ATCC). Consequently the supernatant showed a TNF activity 
of 2.9 X 103 units/ml. This means that the SV40 promoter on the vector acts as expected. 
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Table 1 



Examples of clone containing in the cDNA bank. 


Clone 


Known protein showing similarity 


number 




1 


HMG14 


2 


HMG17 


3 


Ribosomai protein SI 4 


4 


Ribosomai protein SI 7 


5 


Ribosomai phosphoprotein PI 


6 


Elongation factor la 


7 


Initiation facto 2o 


8 


Glyceraldehyde-3-phosphate dehydrogenase 


9 


Pyruvate kinase 


10 


/8-actin 


11 


Profilin 


12 


Heat shock protein 90kDa 


13 


Gaiactosidase A 


14 


l_aminin binding protein 


15 


Plasminogen activator Inhibitor 2 


16 


TNF (Tumor Necrosis Factor) 


17 


Mitochondria cytochrome b 



Claims 

1. A cloning vector plasmid comprising, in the order mentioned from upstream to downstream region unless 
otherwise specified, a promoter PR1 acting In mammalian cells; an unique restriction enzyme site RE1 
rarely existing in eukaryotic DNA and a promoter PR2 of RNA polymerase; an unique restriction enzyme 
site RE2 generating 3 -protruding dG tail; an unique arbitrary restriction enzyme site RE3; an unique 
arbitrary restriction enzyme site RE4 generating 3'-protruding terminus; an unique restriction enzyme site 
RES rarely existing in eukaryotic DNA and a promoter PR3 of RNA polymerase; and at least one kind of 
arbitrary restriction enzyme site RES generating 3'-protruding terminus existing at the arbitrary position 
before the restriction site RE1 ; and further, at arbitrary positions other than the above-described restriction 
enzyme sites and promoters, an origin 0R1 for replication in mammalian cells, an origin 0R2 for replication 
of a single-stranded phage, an origin 0R3 for replication in Eco//, and a selective marker. 

2. The plasmid of claim 1 , which Is a cloning vector plasmid pKAl . 

3. A vector-primer which is prepared by cutting said plasmid of claim 1 at the restriction site RE4; adding 
thereto dT-tail by terminal deoxynucleotldyl transferase and then cutting the resultant at the restriction site 
RE3. 

4. The vector-primer of claim 3, wherein the plasmid of claim 2 Is cut with restriction enzyme KpnI. the dT 
tall is added thereto with terminal deoxynucleotldyl transferase and then the resultant Is cut with restriction 
enzyme EcoRV. 

5. A method for preparing a cDNA bank including cDNA vectors which contains a sequence-determined 
cDNA insert and can be expressed in mammalian cells, which comprises synthesizing cDNAs from poly(A)- 
*RNA isolated from cells using a vector-primer of claim 3 or claim 4, transforming E.coli with vectors 
containing said cDNAs, preparing a cDNA plaque library by infection of helper phage, and sequencing 
cDNA from 5 terminal end using a single-stranded DNA isolated from a single-stranded phage plaque. 

6. A method for preparing a library-specific cDNA bank including cDNA vectors which contains a sequence- 
determined cDNA insert and can be expressed in mammalian ceils, which comprises preparing two kinds of 
cDNA library LA and LB using the method of claim 5, preparing sense RNA probes from the library LA 
using RNA polymerase, preparing a group of anti-sense single-stranded cDNAs from the library LB, 
obtaining the library LA-specific cDNA clones by screening the library LA using probes which contain a 
hybridization mixture between RNA probes and said single-stranded cDNAs, and sequencing said library 
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LA-specific cDNA from S terminal. 
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